143 research outputs found

    Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V

    Full text link
    In this paper, we critically evaluate the capabilities of the state-of-the-art multimodal large language model, i.e., GPT-4 with Vision (GPT-4V), on Visual Question Answering (VQA) task. Our experiments thoroughly assess GPT-4V's proficiency in answering questions paired with images using both pathology and radiology datasets from 11 modalities (e.g. Microscopy, Dermoscopy, X-ray, CT, etc.) and fifteen objects of interests (brain, liver, lung, etc.). Our datasets encompass a comprehensive range of medical inquiries, including sixteen distinct question types. Throughout our evaluations, we devised textual prompts for GPT-4V, directing it to synergize visual and textual information. The experiments with accuracy score conclude that the current version of GPT-4V is not recommended for real-world diagnostics due to its unreliable and suboptimal accuracy in responding to diagnostic medical questions. In addition, we delineate seven unique facets of GPT-4V's behavior in medical VQA, highlighting its constraints within this complex arena. The complete details of our evaluation cases are accessible at https://github.com/ZhilingYan/GPT4V-Medical-Report

    Enhancing Building Semantic Segmentation Accuracy with Super Resolution and Deep Learning: Investigating the Impact of Spatial Resolution on Various Datasets

    Full text link
    The development of remote sensing and deep learning techniques has enabled building semantic segmentation with high accuracy and efficiency. Despite their success in different tasks, the discussions on the impact of spatial resolution on deep learning based building semantic segmentation are quite inadequate, which makes choosing a higher cost-effective data source a big challenge. To address the issue mentioned above, in this study, we create remote sensing images among three study areas into multiple spatial resolutions by super-resolution and down-sampling. After that, two representative deep learning architectures: UNet and FPN, are selected for model training and testing. The experimental results obtained from three cities with two deep learning models indicate that the spatial resolution greatly influences building segmentation results, and with a better cost-effectiveness around 0.3m, which we believe will be an important insight for data selection and preparation

    Evaluation of oral Lanzhou lamb rotavirus vaccine via passive transfusion with CD4+/CD8+ T lymphocytes

    Get PDF
    AbstractLanzhou Lamb derived Rotavirus (RV) Vaccine (namely LLR) for children is only used in China. Since there were no reports on evaluation of LLR, even the data of phase IV clinical trial, we proceed the evaluation of LLR through focusing on T-cell to investigate whether LLR could induce the potential function involving in protection as a vaccine. Four groups of nude mice were transfused with CD4+/CD8+ T-cells isolated from LLR-immunized (primed) and LLR-unimmunized (naïve) mice via intraperitonea (i.p.) respectively. Consequently, the adoption mice were challenged with mice-origin wild rotavirus EDIM (Epizootic Diarrhea of Infant Mice) by intragastric administration. Series of fecal/serum samples were collected and viral shedding, then serum IgA/IgG and secreted IgA were assayed. Compared to the mice transfused with T lymphocytes from naïve mice, the nude mice transfused with CD4+ T lymphocytes from primed mice induce fecal and serum IgA increasing more rapidly, and have a shorter duration of virus shedding too. Whereas, no significant difference in virus clearance was found between the mice transfused with CD8+ T lymphocytes isolated from primed and naïve mice. Therefore, we cleared the distinct roles of transfused CD4+/CD8+ T lymphocytes for rotavirus clearance in nude mice, that the viral clearance conducted by CD4+ T lymphocytes. Meanwhile, it has ability to help induction of LLR specific immunogenicity. Comparing with the transfusion of cell from primed and naïve mice, LLR can induce CD4+ T lymphocytes memory which is a potential index to reflect the immunogenicity and protection, while CD8+ T lymphocytes remove rotavirus by CTL with little memory ability

    CellMix: A General Instance Relationship based Method for Data Augmentation Towards Pathology Image Classification

    Full text link
    In pathology image analysis, obtaining and maintaining high-quality annotated samples is an extremely labor-intensive task. To overcome this challenge, mixing-based methods have emerged as effective alternatives to traditional preprocessing data augmentation techniques. Nonetheless, these methods fail to fully consider the unique features of pathology images, such as local specificity, global distribution, and inner/outer-sample instance relationships. To better comprehend these characteristics and create valuable pseudo samples, we propose the CellMix framework, which employs a novel distribution-oriented in-place shuffle approach. By dividing images into patches based on the granularity of pathology instances and shuffling them within the same batch, the absolute relationships between instances can be effectively preserved when generating new samples. Moreover, we develop a curriculum learning-inspired, loss-driven strategy to handle perturbations and distribution-related noise during training, enabling the model to adaptively fit the augmented data. Our experiments in pathology image classification tasks demonstrate state-of-the-art (SOTA) performance on 7 distinct datasets. This innovative instance relationship-centered method has the potential to inform general data augmentation approaches for pathology image classification. The associated codes are available at https://github.com/sagizty/CellMix

    Preparation of fluorescence-encoded microspheres in a core-shell structure for suspension arrays

    Get PDF
    Fluorescence-encoded microspheres are widely used in the detection and analysis of biological molecules, especially in suspension arrays. Here, we report an efficient strategy for the preparation of fluorescence-encoded polystyrene microspheres with desirable optical and surface properties. The micron-sized, monodisperse polystyrene seed beads were first synthesized by dispersion polymerization. Then, dye molecules and carboxyl functional groups were copolymerized on the surface of the seed beads by forming a core-shell structure. Rhodamine 6G (R6G) was used as a model dye molecule to prepare the fluorescent beads, and the fluorescence intensity of the beads can be precisely controlled by adjusting the quantity of R6G. These fluorescent beads were characterized by environmental scanning electron microscopy, laser scanning confocal microscopy, and spectrofluorometry. The differences of the fluorescence spectra between fluorescent beads and R6G in solution were investigated. Twelve kinds of fluorescent beads encoded with different R6G fluorescence intensities were prepared, and they can be clearly distinguished on a conventional flow cytometer. Furthermore, the encoded beads are stable in water and resistant to photobleaching, which is crucial for their potential applications in diagnostic assays and imaging. Detection of human alpha fetoprotein antigen via a sandwich microsphere-based immunoassay yielded a detection limit of 80 pg mL(-1), demonstrating that the fluorescence-encoded microspheres synthesized herein are efficient in serving as the microcarriers in suspension arrays. As both the encoding and functionalizing procedures are made simultaneously, the newly designed technique is extremely simple and time-saving. Moreover, it could be readily applicable to the preparation of a wide size range of fluorescent particles made by polymerization.National Natural Science Foundation of China [20675070]; Program for New Century Excellent Talents in University [NCET-07-0729]; NFFTBS [J0630429]; Scientific Research Foundation ; State Education Ministr

    Research on the derated power data identification method of a wind turbine based on a multi-Gaussian-discrete joint probability model

    Get PDF
    This paper focuses on how to identify normal, derated power and abnormal data in operation data, which is key to intelligent operation and maintenance applications such as wind turbine condition diagnosis and performance evaluation. Existing identification methods can distinguish normal data from the original data, but usually remove power curtailment data as outliers. A multi-Gaussian-discrete probability distribution model was used to characterize the joint probability distribution of wind speed and power from wind turbine SCADA data, taking the derated power of the wind turbine as a hidden random variable. The maximum expectation algorithm (EM), an iterative algorithm derived from model parameters estimation, was applied to achieve the maximum likelihood estimation of the proposed probability model. According to the posterior probability of the wind-power scatter points, the normal, derated power and abnormal data in the wind turbine SCADA data were identified. The validity of the proposed method was verified by three wind turbine operational data sets with different distribution characteristics. The results are that the proposed method has a degree of universality with regard to derated power operational data with different distribution characteristics, and in particular, it is able to identify the operating data with clustered distribution effectively

    MA-SAM: Modality-agnostic SAM Adaptation for 3D Medical Image Segmentation

    Full text link
    The Segment Anything Model (SAM), a foundation model for general image segmentation, has demonstrated impressive zero-shot performance across numerous natural image segmentation tasks. However, SAM's performance significantly declines when applied to medical images, primarily due to the substantial disparity between natural and medical image domains. To effectively adapt SAM to medical images, it is important to incorporate critical third-dimensional information, i.e., volumetric or temporal knowledge, during fine-tuning. Simultaneously, we aim to harness SAM's pre-trained weights within its original 2D backbone to the fullest extent. In this paper, we introduce a modality-agnostic SAM adaptation framework, named as MA-SAM, that is applicable to various volumetric and video medical data. Our method roots in the parameter-efficient fine-tuning strategy to update only a small portion of weight increments while preserving the majority of SAM's pre-trained weights. By injecting a series of 3D adapters into the transformer blocks of the image encoder, our method enables the pre-trained 2D backbone to extract third-dimensional information from input data. The effectiveness of our method has been comprehensively evaluated on four medical image segmentation tasks, by using 10 public datasets across CT, MRI, and surgical video data. Remarkably, without using any prompt, our method consistently outperforms various state-of-the-art 3D approaches, surpassing nnU-Net by 0.9%, 2.6%, and 9.9% in Dice for CT multi-organ segmentation, MRI prostate segmentation, and surgical scene segmentation respectively. Our model also demonstrates strong generalization, and excels in challenging tumor segmentation when prompts are used. Our code is available at: https://github.com/cchen-cc/MA-SAM
    • …
    corecore